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3 <110> APPLICANT: SEKISUI CHEMICAL CO., LTD . 

4 MARINE BIOTECHNOLOGY INSTITUTE CO., LTD. 

5 IDENO, Akira 

6 MARUYAMA, Tadashi 

7 FURUTANI, Masahiro 

9 <120> TITLE OF INVENTION: EXPRESSION VECTOR, HOST, FUSED PROTEIN, PROCESS FOR 
PRODUCING 



10 






FUSED 


PROTEIN 


AND 


PROCESS 


FOR 


PRODUCING PROTEIN 








<130> 


FILE REFERENCE: Q83564 


















C- - > 14 


<140> 


CURRENT APPLICATION NUMBER: US/10/511,098 










C — > 14 


<141> 


CURRENT FILING DATE: 


2004 


-10- 


14 














14 


<150> 


PRIOR 


APPLICATION 


NUMBER: 


PCT/JP2003/008020 








lb 


<151> 


PRIOR 


FILING DATE 


: 2003-06-25 
















1 / 


<150> 


PRIOR 


APPLICATION 


NUMBER: 


JP 


2002 


-185020 










lo 


<151> 


PRIOR 


FILING DATE 


: 2002-06-25 
















20 


<160> 


NUMBER OF 


SEQ 


ID NOS: 


30 


















22 


<170> 


SOFTWARE : 


Patentln version 3. 


3 














24 


<210> 


SEQ ID NO 


: 1 
























25 


<211> 


LENGTH: 257 
























26 


<212> 


TYPE: 


PRT 


























27 


<213> 


ORGANISM: 


Pyrococcus horikoshii 














29 


<400> 


SEQUENCE : 


1 
























31 


Met 


Lys 


Val 


Glu 


Arg 


Gly 


Asp 


Val 


He 


Arg 


Leu 


His 


Tyr 


Thr 


Gly 


Arg 


32 


1 










5 










10 










15 




35 


Val 


Lys 


Glu 


Thr 


Gly 


Gin 


He 


Phe 


Asp 


Thr 


Thr 


Tyr 


Glu 


Glu 


Val 


Ala 


36 










20 










25 










30 






39 


Lys 


Glu 


Ala 


Gly 


He 


Tyr 


Asn 


Pro 


Lys 


Gly 


He 


Tyr 


Gly 


Pro 


Val 


Pro 


40 








35 










40 










45 








43 


He 


He 


Val 


Gly 


Ala 


Gly 


His 


Val 


He 


Ser 


Gly 


Leu 


Asp 


Lys 


Arg 


Leu 


44 




50 










55 










60 










47 


Val 


Gly 


Leu 


Glu 


Val 


Gly 


Lys 


Lys 


Tyr 


Thr 


Leu 


Glu 


Val 


Pro 


Pro 


Glu 


48 


65 












70 










75 










80 


51 


Glu 


Gly 


Phe 


Gly 


Leu 


Arg 


Asp 


Pro 


Lys 


Leu 


He 


Lys 


Val 


Phe 


Thr 


Met 


52 












85 










90 










95 




55 


Gly 


Gin 


Phe 


Arg 


Lys 


Gin 


Gly 


He 


Val 


Pro 


Phe 


Pro 


Gly 


Leu 


Glu 


Val 


56 










100 










105 










110 






59 


Glu 


Val 


Thr 


Thr 


Asp 


Asn 


Gly 


Arg 


Lys 


Met 


Lys 


Gly 


Arg 


Val 


He 


Thr 


60 








115 










120 










125 








63 


Val 


Ser 


Gly 


Gly 


Arg 


Val 


Arg 


Val 


Asp 


Phe 


Asn 


His 


Pro 


Leu 


Ala 


Gly 


64 




130 










135 










140 










67 


Lys 


Thr 


Leu 


He 


Tyr 


Glu 


Val 


Glu 


He 


Val 


Glu 


Lys 


He 


Glu 


Asp 


Pro 


68 


145 












150 










155 










160 


71 


He 


Glu 


Lys 


He 


Lys 


Ala 


Leu 


He 


Glu 


Leu 


Arg 


Leu 


Pro 


Met 


He 


Asp 


72 












165 










170 










175 
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75 Arg Asp Lys Val He He Glu Val Gly Glu Lys Asp Val Lys Val Asn 

76 180 185 190 

79 Phe Gly Glu Gin Asp Val Asp Pro Lys Thr Leu He Leu Gly Glu He 

80 195 200 205 

83 Leu Leu Glu Ser Asp He Lys Phe Leu Gly Tyr Glu Lys Val Glu Phe 

84 210 215 220 

87 Lys Pro Ser Val Glu Glu Leu Leu Arg Pro Lys Gin Glu Glu Pro Val 

88 225 230 235 240 

91 Glu Glu Glu Lys Lys Glu Glu Gin Glu Glu Ser Glu Glu Ala Gin Ser 

92 245 250 255 
95 Ser 

99 <210> SEQ ID NO: 2 

100 <211> LENGTH: 157 

101 <212> TYPE: PRT 

102 <213> ORGANISM: Methanococcus jannaschii 
104 <400> SEQUENCE: 2 



106 


Leu 


He 


Asn 


Leu He 


Lys 


Lys 


Gly 


Asp 


Tyr Val Lys Val Asp 


Tyr lie 


107 


1 






5 










10 


15 


110 


Leu 


Glu 


Val 


Asp Gly 


Lys 


Val 


He 


Asp 


Thr Ser He Glu Glu 


Val Ala 


111 








20 








25 


30 




114 


Lys 


Glu 


Asn 


Lys He 


Tyr 


Tyr 


Pro 


Glu 


Arg Glu Tyr Glu Pro 


He Gly 


115 






35 








40 




45 




118 


Phe 


He 


Val 


Gly Asn 


Gly 


Glu 


Leu 


He 


Glu Gly Phe Glu Glu 


Ala. Val 


119 




50 








55 






60 




122 


He 


Gly 


Met 


Glu Val 


Gly 


Glu 


Glu 


Lys 


Thr Val Thr He Pro 


Pro Glu 


123 


65 








70 








75 


80 


126 


Lys 


Gly 


Tyr 


Gly Leu 


Arg 


Asp 


Glu 


Arg 


Leu He Gin Glu He 


Pro Lys 


127 








85 










90 


95 


130 


Glu 


Met 


Phe 


Ala Asp 


Ala 


Asp 


Phe 


Glu 


Pro Gin Glu Gly Met 


Leu He 


131 








100 








105 


110 




134 


Leu 


Ala 


Ser 


Gly He 


Pro 


Ala 


Lys 


He 


He Lys Val Thr Asp Asp Thr 


135 






115 








120 




125 




138 


Val 


Thr 


Leu 


Asp Phe 


Asn 


His 


Glu 


Leu 


Ala Gly Lys Glu Leu 


Lys Phe 


139 




130 








135 






140 




142 


Thr 


He 


Lys 


Val Arg 


Asp 


Val 


Gin 


Pro 


Ala Glu Ser Glu 




143 


145 








150 








155 





146 <210> SEQ ID NO: 3 

147 <211> LENGTH: 432 

148 <212> TYPE: PRT 

149 <213> ORGANISM: Escherichia coli 
151 <400> SEQUENCE: 3 

153 Met Gin Val Ser Val Glu Thr Thr Gin Gly Leu Gly Arg Arg Val Thr 

154 15 10 15 

157 He Thr He Ala Ala Asp Ser He Glu Thr Ala Val Lys Ser Glu Leu 

158 20 25 30 

161 Val Asn Val Ala Lys Lys Val Arg He Asp Gly Phe Arg Lys Gly Lys 

162 35 40 45 

165 Val Pro Met Asn He Val Ala Gin Arg Tyr Gly Ala Ser Val Arg Gin 

166 50 55 60 
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169 


Asp 


Val 


Leu 


Gly Asp 


Leu 


Met 


Ser Arg 


Asn 


Phe 


He 


Asp 


Ala 


He 


He 


170 


65 








70 








75 










80 


173 


Lys 


Glu 


Lys 


He Asn 


Pro Ala Gly Ala 


Pro 


Thr 


Tyr 


Val 


Pro 


Gly 


Glu 


174 








85 








90 










95 




177 


Tyr 


Lys 


Leu 


Gly Glu 


Asp 


Phe 


Thr Tyr 


Ser 


Val 


Glu 


Phe 


Glu 


Val 


Tyr 


178 








100 






105 










110 






181 


Pro 


Glu 


Val 


Glu Leu 


Gin Gly Leu Glu 


Ala 


He 


Glu 


Val 


Glu 


Lvs 


Pro 


182 






115 








120 








125 








185 


He 


Val 


Glu 


Val Thr 


Asp Ala Asp Val 


Asp 


Gly 

2 


Met 


Leu 


Asp 


Thr 


Leu 


186 




130 








135 








140 










189 


Arq 


Lys 


Gin 


Gin Ala 


Thr 


Trp 


Lys Glu 


LVS 


Asp 


Glv 


Ala 


Val 


Glu 


Ala 


190 


145 








150 








155 










160 


193 


Glu 


Asp 


Arq 


Val Thr 


He 


Asp 


Phe Thr 


Gly 


Ser 


Val 


ASD 


Gly 


Glu 


Glu 


194 








165 








170 










175 




197 


Phe 


Glu 


Gly 


Gly Lys 


Ala 


Ser 


Asp Phe 


Val 


Leu 


Ala 


Met 


Gly 


Gin 


Glv 


198 








180 






185 










190 






201 


Arq 


Met 


He 


Pro Gly 


Phe Glu Asp Gly 


He 


Lvs 


Glv 


His 


Lvs 


Ala 


Glv 


202 






195 








200 








205 






* 


205 


Glu 


Glu 


Phe 


Thr He 


Asp Val 


Thr Phe 


Pro 


Glu 


Glu 


Tvr 


His 


Ala 


Glu 


206 




210 








215 








220 










209 


Asn 


Leu 


Lvs 


Glv Lvs 


Ala 


Ala 


Lys Phe 


Ala 


He 


Asn 


Leu 


Lys 


Lys 


Val 


210 


225 








230 








235 










240 


213 


Glu 


Glu 


Arq 


Glu Leu 


Pro 


Glu 


Leu Thr 


Ala 


Glu 


Phe 


He 


Lvs 


Arq 


Phe 


214 








245 








250 










255 




217 


Gly 


Val 


Glu 


Asp Gly 


Ser 


Val 


Glu Gly 


Leu 


Arq 


Ala 


Glu 


Val 


Arq 


Lvs 


218 








260 






265 










270 






221 


Asn 


Met 


Glu 


Arg Glu 


Leu 


Lys 


Ser Ala 


He 


Arq 


Asn 


Arq 


Val 


LVS 


Ser 


222 






275 








280 








285 








225 


Gin 


Ala 


He 


Glu Gly 


Leu 


Val 


Lys Ala 


Asn 


Asp 


He 


Asp 


Val 


Pro 


Ala 


226 




290 








295 








300 










229 


Ala 


Leu 


He 


Asp Ser 


Glu 


He 


Asp Val 


Leu 


Arq 


Arq 


Gin 


Ala 


Ala 


Gin 


230 


305 








310 








315 










320 


233 


Arg 


Phe 


Gly 


Gly Asn 


Glu 


Lys 


Gin Ala 


Leu 


Glu 


Leu 


Pro 


Arq 


Glu 


Leu 


234 








325 








330 










335 




237 


Phe 


Glu 


Glu 


Gin Ala 


Lys Arg Arg Val 


Val 


Val 


Gly 


Leu 


Leu 


Leu 


Gly 


238 








340 






345 










350 






241 


Glu 


Val 


He 


Arg Thr 


Asn Glu Leu Lys 


Ala 


Asp 


Glu 


Glu 


Arq 


Val 


Lys 


242 






355 








360 








365 








245 


Gly 


Leu 


He 


Glu Glu 


Met 


Ala 


Ser Ala 


Tyr 


Glu 


Asp 


Pro 


Lys 


Glu 


Val 


246 




370 








375 








380 










249 


He 


Glu 


Phe 


Tyr Ser 


Lys 


Asn Lys Glu 


Leu 


Met 


Asp 


Asn 


Met 


Arg 


Asn 


250 


385 








390 








395 










400 


253 


Val 


Ala 


Leu 


Glu Glu 


Gin 


Ala 


Val Glu 


Ala 


Val 


Leu 


Ala 


Lys 


Ala 


Lys 


254 








405 








410 










415 




257 


Val 


Thr 


Glu 


Lys Glu 


Thr 


Thr 


Phe Asn 


Glu 


Leu 


Met 


Asn 


Gin 


Gin 


Ala 


258 








420 






425 










430 






261 


<210> SEQ ID NO: 4 






















262 


<211> LENGTH: 1299 






















263 


<212> TYPE: 


DNA 
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264 <213> ORGANISM: Escherichia coli 

266 <400> SEQUENCE: 4 

267 atgcaagttt cagttgaaac cactcaaggc cttggccgcc gtgtaacgat tactatcgct 
269 gctgacagca tcgagaccgc tgttaaaagc gagctggtca acgttgcgaa aaaagtacgt 
271 attgacggct tccgcaaagg caaagtgcca atgaatatcg ttgctcagcg ttatggcgcg 
273 tctgtacgcc aggacgttct gggtgacctg atgagccgta acttcattga cgccatcatt 
2 75 aaagaaaaaa tcaatccggc tggcgcaccg acttatgttc cgggcgaata caagctgggt 
277 gaagacttca cttactctgt agagtttgaa gtttatccgg aagttgaact gcagggtctg 
2 79 gaagcgatcg aagttgaaaa accgatcgtt gaagtgaccg acgctgacgt tgacggcatg 
2 81 ctggatactc tgcgtaaaca gcaggcgacc tggaaagaaa aagacggcgc tgttgaagca 
283 gaagaccgcg taaccatcga cttcaccggt tctgtagacg gcgaagagtt cgaaggcggt 
285 aaagcgtctg atttcgtact ggcgatgggc cagggtcgta tgatcccggg ctttgaagac 
287 ggtatcaaag gccacaaagc tggcgaagag ttcaccatcg acgtgacctt cccggaagaa 
289 taccacgcag aaaacctgaa aggtaaagca gcgaaattcg ctatcaacct gaagaaagtt 
291 gaagagcgtg aactgccgga actgactgca gaattcatca aacgtttcgg cgttgaagat 
293 ggttccgtag aaggtctgcg cgctgaagtg cgtaaaaaca tggagcgcga gctgaagagc 
295 gccatccgta accgcgttaa gtctcaggcg atcgaaggtc tggtaaaagc taacgacatc 
2 97 gacgtaccgg ctgcgctgat cgacagcgaa atcgacgttc tgcgtcgcca ggctgcacag 

2 99 cgtttcggtg gcaacgaaaa acaagctctg gaactgccgc gcgaactgtt cgaagaacag 
301 gctaaacgcc gcgtagttgt tggcctgctg ctgggcgaag ttatccgcac caacgagctg 
303 aaagctgacg aagagcgcgt gaaaggcctg atcgaagaga tggcttctgc gtacgaagat 
305 ccgaaagaag ttatcgagtt ctacagcaaa aacaaagaac tgatggacaa catgcgcaat 

3 07 gttgctctgg aagaacaggc tgttgaagct gtactggcga aagcgaaagt gactgaaaaa 
3 09 gaaaccactt tcaacgagct gatgaaccag caggcgtaa 

312 <210> SEQ ID NO: 5 

313 <211> LENGTH: 270 

314 <212> TYPE: PRT 

315 <213> ORGANISM: Escherichia coli 
317 <400> SEQUENCE: 5 



319 


Met 


Lys 


Ser 


Leu 


Phe 


Lys 


Val 


Thr 


Leu 


Leu Ala 


Thr 


Thr 


Met 


Ala 


Val 


320 


1 








5 










10 








15 




323 


Ala 


Leu 


His 


Ala 


Pro 


He 


Thr 


Phe 


Ala 


Ala Glu 


Ala 


Ala 


Lys 


Pro 


Ala 


324 








20 










25 








30 






327 


Thr 


Ala 


Ala 


Asp 


Ser 


Lys 


Ala 


Ala 


Phe 


Lys Asn 


Asp 


Asp 


Gin 


Lys 


Ser 


328 






35 










40 








45 








331 


Ala 


Tyr 


Ala 


Leu 


Gly 


Ala 


Ser 


Leu 


Gly 


Arg Tyr 


Met 


Glu 


Asn 


Ser 


Leu 


332 




50 










55 








60 










335 


Lys 


Glu 


Gin 


Glu 


Lys 


Leu 


Gly He Lys 


Leu Asp 


Lys 


Asp 


Gin 


Leu 


He 


336 


65 










70 








75 










80 


339 


Ala 


Gly 


Val 


Gin 


Asp 


Ala 


Phe 


Ala 


Asp 


Lys Ser 


Lys 


Leu 


Ser 


Asp 


Gin 


340 










85 










90 








95 




343 


Glu 


He 


Glu 


Gin 


Thr 


Leu 


Gin 


Ala 


Phe 


Glu Ala 


Arg 


Val 


Lys 


Ser 


Ser 


344 








100 










105 








110 






347 


Ala 


Gin 


Ala 


Lys 


Met 


Glu 


Lys 


Asp 


Ala 


Ala Asp 


Asn 


Glu 


Ala 


Lys 


Gly 


348 






115 










120 








125 








351 


Lys 


Glu 


Tyr 


Arg 


Glu 


Lys 


Phe 


Ala 


Lys 


Glu Lys 


Gly 


Val 


Lys 


Thr 


Ser 


352 




130 










135 








140 










355 


Ser 


Thr 


Gly 


Leu 


Val 


Tyr 


Gin 


Val 


Val 


Glu Ala 


Gly 


Lys 


Gly 


Glu 


Ala 



356 145 150 155 160 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
72 0 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1299 
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359 


Pro 


Lys 


Asp 


Ser Asp 


Thr 


Val 


Val 


Val 


Asn 


Tyr 


Lys 


Gly 


Thr 


Leu 


He 


360 








165 










170 










175 




363 


Asp 


Gly 


Lys 


Glu Phe 


Asp 


Asn 


Ser 


Tyr 


Thr 


Arg 


Gly 


Glu 


Pro 


Leu 


Ser 


364 








180 








185 










190 






367 


Phe 


Arg 


Leu 


Asp Gly 


Val 


He 


Pro 


Gly 


Trp 


Thr 


Glu 


Gly 


Leu 


Lys 


Asn 


368 






195 








200 










205 








371 


He 


Lys 


Lys 


Gly Gly 


Lys 


He 


Lys 


Leu 


Val 


He 


Pro 


Pro 


Glu 


Leu 


Ala 


372 




210 








215 










220 










375 


Tyr 


Gly 


Lys 


Ala Gly 


Val 


Pro 


Gly 


He 


Pro 


Pro 


Asn 


Ser 


Thr 


Leu 


Val 


376 


225 








230 










235 










240 


379 


Phe 


Asp 


Val 


Glu Leu 


Leu 


Asp 


Val 


Lys 


Pro 


Ala 


Pro 


Lys 


Ala 


Asp 


Ala 


380 








245 










250 










255 




383 


Lys 


Pro 


Glu 


Ala Asp 


Ala 


Lys 


Ala 


Ala 


Asp 


Ser 


Ala 


Lys 


Lys 







384 260 265 270 

387 <210> SEQ ID NO: 6 

388 <211> LENGTH: 813 

389 <212> TYPE: DNA 

390 <213> ORGANISM: Escherichia coli 

392 <400> SEQUENCE: 6 

393 atgaaatcac tgtttaaagt aacgctgctg gcgaccacaa tggccgttgc cctgcatgca 60 
3 95 ccaatcactt ttgctgctga agctgcaaaa cctgctacaa ctgctgacag caaagcagcg 12 0 
3 97 ttcaaaaatg acgatcagaa atcagcttat gcactgggtg cttcgctggg tcgttacatg 180 
3 99 gaaaactctc ttaaagaaca agaaaaactg ggcatcaaac tggataaaga tcagctgatc 240 
401 gctggtgttc aggatgcatt tgctgataag agcaaacttt ccgaccaaga gatcgaacag 300 
403 actctgcaag cattcgaagc tcgcgtgaag tcttctgctc aggcgaagat ggaaaaagac 360 
405 gcggctgata acgaagcaaa aggtaaagag taccgcgaga aatttgccaa agagaaaggt 42 0 
407 gtgaaaacct cttccactgg tctggtttat caggtagtag aagccggtaa aggcgaagcc 480 
409 ccgaaagaca gcgatactgt tgtagtgaac tacaaaggta cgctgatcga cggtaaagag 540 
411 ttcgacaact cttacacccg tggtgaaccg ctctctttcc gtctggacgg tgttatcccg 600 
413 ggttggacag aaggtctgaa gaacatcaag aaaggcggta agatcaaact ggttattcca 660 
415 ccagaactgg cttacggcaa agcgggtgtt ccggggatcc caccgaattc taccctggtg 72 0 
417 tttgacgtag agctgctgga tgtgaaacca gcgccgaagg ctgatgcaaa gccggaagct 780 
419 gatgcgaaag ccgcagactc tgctaaaaaa taa 813 

422 <210> SEQ ID NO: 7 

423 <211> LENGTH: 428 

424 <212> TYPE: PRT 

425 <213> ORGANISM: Escherichia coli 
427 <400> SEQUENCE: 7 

42 9 Met Lys Asn Trp Lys Thr Leu Leu Leu Gly He Ala Met He Ala Asn 
430 15 10 15 

433 Thr Ser Phe Ala Ala Pro Gin Val Val Asp Lys Val Ala Ala Val Val 

434 20 25 30 

437 Asn Asn Gly Val Val Leu Glu Ser Asp Val Asp Gly Leu Met Gin Ser 

438 35 40 45 

441 Val Lys Leu Asn Ala Ala Gin Ala Arg Gin Gin Leu Pro Asp Asp Ala 

442 50 55 60 

445 Thr Leu Arg His Gin He Met Glu Arg Leu He Met Asp Gin He He 

446 65 70 75 80 
449 Leu Gin Met Gly Gin Lys Met Gly Val Lys He Ser Asp Glu Gin Leu 
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